Investigation of eight candidate genes on chromosome 1p36 for autosomal dominant total congenital cataract.

PURPOSE
To identify the causative gene for autosomal dominant total congenital cataract in a six-generation Australian family displaying linkage to chromosome 1p36.


METHODS
Eight candidate genes (HSPB7, FBXO42, EFHD2, ZBTB17, CAPZB, FBLIM1, ALDH4A1, and MFAP2) from within the previously defined linkage interval were selected based on expression in lens and their known or putative function. The coding exons were sequenced in multiple affected family members and compared to the reference sequence.


RESULTS
No segregating mutations were identified in any of the eight genes. Thirty-one polymorphisms were detected, 20 of which were in the exons and 11 in the flanking introns.


CONCLUSIONS
Coding mutations in HSPB7, FBXO42, EFHD2, ZBTB17, CAPZB, FBLIM1, ALDH4A1, and MFAP2 do not account for congenital cataract in this family.

Congenital cataract is a relatively rare condition leading to severe visual impairment or blindness in affected children. The disorder is highly heterogeneous with 17 genes reported to date and an additional 16 loci reported for which the gene remains to be identified. The known genes to date include structural proteins (lens crystallins and cytoskeletal proteins), transport molecules (gap junctions and aquaporins), and transcription factors (developmental and stress response). All forms of Mendelian inheritance (autosomal dominant and recessive as well as X-linked) have been reported.
One region of interest in congenital cataract is the short arm of chromosome 1, particularly 1p36. At least two independent loci in three families have been mapped to this region [1][2][3]. In 1995, Eiberg and colleagues [1] mapped the Danish Volkmann-type cataract (OMIM 115665) to a region between the 1p telomere and marker D1S214. Shortly after this in 1997, Ionides and colleagues [2] mapped posterior polar cataract in a three-generation British family to a similar but larger region between the telomere and D1S2845. More recently, complete congenital cataract displaying autosomal dominant inheritance in a six-generation Australian family was localized to a six megabase (Mb) region between markers D1S228 and D1S199, which contains 60 annotated genes [3]. This region overlaps with that of the British family but not the Danish family. Thus, there are at least two congenital cataract loci on 1p36.
This report describes the investigation of eight positional and functional candidate genes for complete congenital cataract in the Australian family previously described ( Figure  1) [3]. The genes were chosen on the basis of lens expression and their known or putative function and were confined within the known linkage interval in our family. They have a variety of functions including stress response, ubiquitination, calcium binding, cytoskeleton association, and metabolism (Table 1).
HSPB7 (heat shock 27 kDa protein 7) is a small heat shock protein that is involved in cellular response to stress. The α-crystallins are members of the small heat shock protein family and are well known to cause autosomal dominant and recessive forms of congenital cataract [4][5][6][7]. FBXO42 (F-box only protein 42) is an E3 ubiquitin ligase, which is involved in the ligation of ubiquitin to target proteins to signal for protein degradation via the proteasome. During lens fiber development, the cells lose their organelles through targeted ubiquitination, and an incomplete degradation of these organelles can result in cataract [8]. The pathway has also been implicated in age-related cataract [9]. EFHD2 (EF hand domain family member D2) is likely a calcium-binding protein. It is expressed in many tissues including lens and fetal eye. Excess calcium in the lens has been shown to cause congenital cataract in rabbits [10], mice [11], and sheep [12]. Calcium activates a family of intracellular proteases known as calpains, some of which are lens-specific [13]. Calpains cleave crystallins causing them to precipitate and aggregate [14].
The cytoskeleton is known to be important in congenital cataract with several mutations reported in the lens specific cytoskeleton protein genes, BFSP1 and BFSP2 [15][16][17].
Several genes in the linkage region are associated with the cytoskeleton. ZBTB17 (zinc-finger and BTB domaincontaining protein 17, also known as myc-interacting protein 1) is a transcription factor that interacts closely with microtubules [18] and activates gene transcription in response to changes in the cytoskeleton and to cell-cell adhesion formation [18,19]. CAPZB is a subunit of CAPZ, an actinbinding protein that regulates growth of the actin filament by capping the barbed end of the growing filament [20]. FBLIM1 (filamin-binding LIM protein 1, also known as migfilin) is a cytoskeleton protein that localizes to cell-cell adhesions, cell-ECM (extracellular matrix) junctions, and the nucleus, where it may also play an important role in transcriptional activation [21]. Links between the actin cytoskeleton and the ECM are important in maintaining tissue integrity and play a part in controlling cell morphology and behavior [21]. ALDH4A1 (aldehyde dehydrogenase 4A1) is a mitochondrial matrix NAD(+)-dependent dehydrogenase, which catalyzes the second step of the proline degradation pathway, converting pyrroline-5-carboxylate to glutamate. Mutations in this gene lead to the recessive disorder hyperprolinemia type II (OMIM 239510), which is characterized by a build up of both proline and pyrroline-5carboxylate in the blood. The accumulation of excess materials in the lens is seen in hereditary hyperferritinemia cataract syndrome (OMIM 600886) and galactosemia (OMIM 230400), where elevated levels of ferritin (an iron storage protein) and galactose respectively lead to cataract. MFAP2 was chosen because of high expression levels in the mouse lenses from embryonic days 10.5-12.5 (data not shown). MFAP2 (microfibril-associated protein 2) is a protein component of elastin microfibrils, which are structural components of the extracellular matrix. Although there is no known direct link between the extracellular matrix and congenital cataract, a defect in the structure of microfibrils may affect the transparency of lens during development.

Approval for the study was given by the Clinical Research Ethics Committees of the Royal Victorian Eye and Ear
Hospital (Melbourne, Australia) and Flinders University (Adelaide, Australia). The family and linkage region on 1p36 has been previously described [3]. A six-generation family with total congenital cataract was ascertained (Figure 1), and DNA was extracted from whole blood or buccal swabs. Linkage analysis at known congenital cataract loci was conducted, and significant linkage was detected on chromosome 1 between markers D1S228 and D1S199. PAX7 (involved in ocular development) was previously excluded as a cause of cataract in this family. DNA from all family members was subjected to whole genome amplification using GenomiPhi (GE Healthcare, Rydalmere, NSW, Australia) according to the manufacturer's protocols to increase the stock of DNA for sequencing candidate genes. Amplified Candidate genes from within the linkage region were chosen on the basis of expression in the human lens or fetal eye either from evidence in publicly available databases    in this study in 77 unrelated controls. The reference allele and genotypes for family members sequenced are also given.
EFHD2, ZBTB17, CAPZB, FBLIM1, ALDH4A1, and MFAP2 (Appendix 1, Figure 2). PAX7 was previously excluded by direct sequencing [3]. Protein coding exons and the intron-exon boundaries of all eight genes were directly sequenced. Multiple polymorphisms were identified in the coding exons and surrounding introns of these candidate genes ( Table 2). No segregating mutations were identified. Of the 31 polymorphisms detected, 20 were in exons. Four of the exon polymorphisms were non-synonymous. All variants were represented in the dbSNP database. Common or intronic SNPs were not investigated further. Frequency information and validation were not available for several reported polymorphisms including the non-synonymous SNP rs35196193 in FBXO42 and the synonymous rs41310410 in ALDH4A1. Based on the individuals sequenced, rs35196193 was excluded due to lack of segregation while segregation of rs41310410 could not be excluded. The minor allele of rs41310410 may introduce a binding site for proteins involved in splicing of the mRNA, specifically the SRp40 protein [22,23]. Such sites are known as exonic splice enhancers and can improve (or may be necessary for) the efficiency with which introns are removed from the mRNA. Thus, this SNP was assessed in all family members for segregation. All affected individuals carried the A allele, and three of these (16.08, 16.10, and 16.11) were homozygous for this allele, suggesting it to be common in the population. To confirm it as a common polymorphism, it was genotyped in 77 unrelated, unaffected individuals, revealing that the A allele occurs at a frequency of 3.2% in the Australian population. One case of homozygosity was detected. This indicates that this SNP is found in the normal population and thus is not likely to be the cause of severe congenital cataract in this family.
From these data, we conclude that none of these eight genes are involved in the development of this severe and highly penetrant congenital cataract, although regulatory mutations have not been ruled out. Of 56 annotated genes in this linkage region, nine (including PAX7) have now been excluded. At least 10 additional genes show lens or fetal eye expression and will form the next group of genes to be assessed in this family.
including UniGene and NEIbank or from the lens microarray database generated by Lachke and Maas (unpublished). Putative or known functions were also considered. Primers (Appendix 1) were designed to amplify the coding exons (including splice sites) of each gene, and PCR products were directly sequenced on the ABI PRISM 3100 Genetic Analyzer (Applied Biosystems, Foster City, CA) with BigDye Terminators (Applied Biosystems) according to standard protocols. Each exon was sequenced in two affected individuals, and sequences were compared to the reference sequence. Due to limited amounts of DNA, different individuals were sequenced for each gene. In some cases, an unaffected descendant was also investigated. Common previously reported variants were not considered a high priority for follow-up in the additional affected or unaffected individuals. A segregating polymorphism with limited information available in public databases (rs41310410) was genotyped in 77 unrelated individuals using the SNaPshot® multiplex kit (Applied Biosystems) to determine its frequency. A primer directly adjacent to the polymorphic nucleotide was annealed to the PCR product, and a single fluorescently tagged ddNTP was incorporated according to the manufacturer's protocols (Applied Biosystems). The products were then electrophoresed on the ABI PRISM 3100 Genetic Analyzer to detect the alleles. The potential of this synonymous single nucleotide polymorphism (SNP) to affect splicing was investigated in silico using the ESEfinder [22,23]

RESULTS AND DISCUSSION
Annotated genes with a function or putative function that could be linked to cataract formation and a known expression in human lens or fetal eye were chosen for sequencing. The fetal eye was considered to be an appropriate tissue due to the congenital (and thus developmental) nature of the phenotype in this family. The genes chosen were HSPB7, FBXO42,